Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals.
نویسندگان
چکیده
The study of a few genes has permitted the identification of three elements that constitute a yeast polyadenyl-ation signal: the efficiency element (EE), the positioning element and the actual site for cleavage and poly-adenyl-ation. In this paper we perform an analysis of oligonucleotide composition on the sequences located downstream of the stop codon of all yeast genes. Several oligonucleotide families appear over-represented with a high significance (referred to herein as 'words'). The family with the highest over-representation includes the oligonucleotides shown experimentally to play a role as EEs. The word with the highest score is TATATA, followed, among others, by a series of single-nucleotide variants (TATGTA, TACATA, TAAATA.) and one-letter shifts (ATATAT). A position analysis reveals that those words have a high preference to be in 3' flanks of yeast genes and there they have a very uneven distribution, with a marked peak around 35 bp after the stop codon. Of the predicted ORFs, 85% show one or more of those sequences. Similar results were obtained using a data set of EST sequences. Other clusters of over-represented words are also detected, namely T- and A-rich signals. Using these results and previously known data we propose a general model for the 3' trailers of yeast mRNAs.
منابع مشابه
In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data
To investigate 3'-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3'-untranslated region (3' UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3' UTR length in A. oryzae was 241...
متن کاملThe yeast FBP1 poly(A) signal functions in both orientations and overlaps with a gene promoter.
This report provides an analysis of a region of chromosome XII in which the FBP1 and YLR376c genes transcribe in the same direction. Our investigation indicates that the Saccharomyces cerevisiae FBP1 gene contains strong signals for polyadenylation and transcription termination in both orientations in vivo . A (TA)14 element plays a major role in directing polyadenylation in both orientations. ...
متن کاملDownstream elements of mammalian pre-mRNA polyadenylation signals: primary, secondary and higher-order structures.
Primary, secondary and higher-order structures of downstream elements of mammalian pre-mRNA polyadenylation signals [poly(A) signals] are re viewed. We have carried out a detailed analysis on our database of 244 human pre-mRNA poly(A) signals in order to characterize elements in their downstream regions. We suggest that the downstream region of the mammalian pre-mRNA poly(A) signal consists of ...
متن کاملApplication of a Naïve Bayes Classifier to Assign Polyadenylation Sites from 3' End Deep Sequencing Data: A Dissertation
Cleavage and polyadenylation of a precursor mRNA is important for transcription termination, mRNA stability, and regulation of gene expression. This process is directed by a multitude of protein factors and cis elements in the pre-mRNA sequence surrounding the cleavage and polyadenylation site. Importantly, the location of the cleavage and polyadenylation site helps define the 3’ untranslated r...
متن کاملStructure of the human sialophorin (CD43) gene. Identification of features atypical of genes encoding integral membrane proteins.
A human sialophorin (CD43) specific genomic clone was isolated, and a 6.5 kb fragment containing the 4.6 kb sialophorin gene was sequenced. The promoter region contains no TATA or CAAT boxes, but is highly enriched in G and C nucleotides and contains short repeat sequences similar to those found in the promoters of 'housekeeping' genes. S1-nuclease protection and primer-extension experiments es...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nucleic acids research
دوره 28 4 شماره
صفحات -
تاریخ انتشار 2000